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Epistemological  Relevance  and  Statistical  Knowledge 
1.  Background. 

For  many  years,  at  lean  since  McCarthy  and  Hayes 
(1969.',  venters  have  lamented,  and  attempted  to  compensate 
tor,  the  alleged  fact  that  we  often  do  not  have  adequate 
statistical  knowledge  for  governing  the  uncertainty  of  belief,  fori 
making  uncertain  inferences,  and  the  like.  It  is  hardly  ever 
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had  it,  and  how  adequate  statistical  knowledge  could  be  used  to 
control  and  regulate  epistemic  uncertainty. 


One  response  to  the  lack  of  adequate  statistics  has  been  to  / 
search  for  non-statistical  measures  of  uncertainty.  The  minimal 
variant  has  been  to  propose  “subjective  probability"  as  a  concept 
to  which  we  can  turn  when  we  lack  statistics. 

This  proposal  comes  in  widely  differing  flavors,  based  on 
the  dreadful  ambiguity  of  "subjective".  Sometimes  this  means 
merely  "indexed  by  a  subject".  In  this  sense  there  is  no  conflict 
with  statistical  representations:  the  "subjectivity"  involved  just 
represents  the  fact  that  statistical  knowledge  is  related  to  a 
knower.  (This  appears  to  be  the  sense  of  "subjective"  employed 
by  Cheeseman  (1987).) 

At  the  other  extreme,  "subjective"  may  mean  "arbitrary, " 
"whimsical,"  "subject  to  no  objective  control  or  constrtaint. " 

Those  who  think  we  must  turn  in  this  direction  are  influenced  by 
the  feeling  that  in  many  cases  there  may  be  nothing  better  to 
turn  to 


Other  proposals  concern  non-probabilistic  measures  of 
uncertainty:  the  certainty  factors  of  Mycin  (1984),  the  belief 
functions  of  Shafer  (1987),  the  fuzzy  membership  relation  of 
Zadeh  (1986). 

Our  purpose  here  is  not  to  evaluate  these  alternative 
treatments  of  uncertainty,  but  rather  to  explore  the  question  of 
how  far  you  can  go  on  the  basis  of  statistical  knowledge  that  you 
do  have,  and  what  considerations  must  be  taken  account  of  in 
this  attempt  Relatively  few  people  have  explored  the  question  of 
how  far  you  can  go  using  statistical  knowledge.  One  writer  who 
has  taken  this  question  seriously  is  Bacchus  (1988) . 

A  second  question,  in  fact  the  one  that  McCarthy  and 
Hayes  had  in  mind,  .s  the  question  of  using  statistical  knowledge 
to  provide  an  underpinning  for  uncertain  inference  --  that  is, 
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presupposition  of  the  non-monotomc  and  default  indus*  r:e 
to  be  that  you  cannot  very  often  base  such  inferences  on 
statistical  knowledge  In  part  this  presupposition  is  bas : d 
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feeling  that  'typicality"  and  "frequency"  mean  different  things 
Be  that  as  it  may,  in  formalizing  non-monotomc  logics  many 
people  seem  to  be  led  to  considerations  that  (not  surprisingly! ) 
mirror  considerations  appropriate  to  the  application  of  statistical 
knowledge. 

Thus  Ethenngton  (1987)  introduces  the  concept  of 
preference  among  models,  Konolige  (1987)  defines  a  notion  of 
minimal  extension:  Tcuretzky  (1986)  gives  a  metric  for 
inferential  distance. 

These  metrics  will  be  reflected  in  one  of  three  principles 
governing  the  relvance  of  statistical  knowledge  to  be  discussed 
below.  Our  analysis  of  the  ground-rules  for  the  use  of  statistical 
knowledge  will  throw  light  on  the  “cancellation  principles"  of 
non-monotonic  logic  as  well. 


2.  Assumptions. 

The  assumptions  we  make  here  are  relatively  few.  We 
suppose  that  the  knowledge  base  may  have  statistical  knowledge 
in  it.  This  statistical  knowledge  may  be  construed  in  a  number 
of  ways  —  for  example  as  statements  concerning  chances  or 
statements  concerning  frequencies  in  an  arbitrarily  long  run,  or 
statements  concerning  frequencies  summed  over  possible  worlds. 
We  do  suppose  that  these  statements  are  general .  that  is  that 
they  do  not  represent  the  fact  that  we  have  recorded  a 
frequency  in  a  specific  sample.  We  mav  have  done  so;  but  we 
also  may  have  gotten  our  statistical  knowledge  from  a  handbook, 
or  a  dependable  colleague.  In  any  event,  the  statistical 
knowledge  in  our  knowledge  base  is  taken  to  be  general  scientific 
knowledge  relating  properties,  notationally,  we  will  write  "%{R,  A) 
=  p  "  to  represent  the  fact  that  the  long-run  frequency  A 's 
among  R 's  is  p. 

Our  second  assumption  is  that  statements  fall  into 
equivalence  classes  with  respect  to  the  statistical  information  that 
is  epistemically  most  relevant  to  them  It  is  sometimes  said  that 
we  can't  use  statistical  knowledge  to  determine  the  chance  of 
heads  on  the  toss  of  a  coin  that  is  to  be  tossed  once  and  then 
destroyed,  on  the  ground  that  we  have  no  statistics  about  that 
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will  lead  to  different  reference  classes  Similarly,  if 
we  know  *hat  this  sample  of  3 's  exhibits  a  relative  frequency  of 
C 's  of  45,  then  the  chance  that  the  long  run  frequency  is  close 
to  the  frequency  m  the  sample  is  the  same  as  the  chance  that 
the  long  run  frequency  is  close  to  .45. 

We  can  express  this  as  a  formal  principle:  If  "  S  —  T"  is 
in  our  knowledge  base,  then  the  same  statistical  knowledge  is 
potentially  relevant  to  S’  and  to  T. 

Our  third  assumption  is  the  general  one  that  our 
knowledge  base  can  be  expressed  in  a  first  order  extensional 
language  (Of  course  this  requires  the  inclusion  of  enough  first 
order  set  theory  to  accommodate  the  statistics!)  We  take  an 
individual,  however,  to  be  arbitrarily  complex:  for  example  it 
might  be  a  trial  of  a  complicated  compound  experiment 

Finally,  in  order  for  statistics  to  be  of  interest,  we  suppose 
that  we  may  know  some  things  about  an  individual  without 
knowing  everything  about  it.  Thus  we  might  know  of  "the  next 
trial"  that  it  is  a  trial  consisting  of  selecting  one  of  a  number  of 
urns  at  random,  and  then  selecting  a  coin  at  random  from  the 
urn,  and  then  tossing  the  coin  10  times.  And  then  we  might  be 
interested  in  whether  the  tenth  toss  landed  heads  on  that  trial, 
or  v/e  might  know  the  distribution  of  heads  in  the  tosses,  and  we 
might  be  interested  in  whether  the  urn  was  urn  number  4,  or 
we  might  be  interested  in  knowing  something  about  the  frequency 
of  two  headed  coins  in  the  urn  from  which  we  got  our  sample. 


3.  Interference  I. 

We  will  be  concerned  with  the  way  in  which  some  items 
of  statistical  knowledge  can  interfere  with  the  epistemic  relevance 
of  other  items  The  simplest  and  clearest  cases  can  be  borrowed 
from  non-monotonic  logic. 

If  all  we  know  of  Tweety  is  that  she  is  a  bird,  it  is 
reasonable  to  believe  that  she  can  fly.  If  we  also  know  that  she 
is  a  penguin,  then  it  is  reasonable  to  believe  that  she  cannot  fly, 
since  our  knowledge  about  the  chances  of  a  penguin  flying 
interferes  with  our  knowledge  about  the  chances  of  a  bird  flying 
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a  rare  kind  of  penguin  that  can  fly,  and  if  we  know  that  Twcetv 
is  one  of  them,  then  this  new  knowledge  interiors  with  our 
general  knowledge  about  penguins,  and  again  we  may  suppose 
that  Tweety  can  fly 

This  relation  has  been  noted  by  Etherington,  Pools, 

Konolige,  and  others  It  corresponds  to  v/hat  Reichenbach  (1949) 
had  in  mind  when  he  said  that  we  should  base  our  posits 
(degrees  of  belief)  on  the  "narrowest"  reference  class  concerning 
which  we  have  adequate  statistics.  ("Having  adequate  statistics" 
does  not  mean  having  knowledge  of  a  sample  of  the  class  in 
question,  it  means  having  useful  general  knowledge  about  that 
class,  whatever  it  may  be  based  on.) 

A  principle  embodying  this  natural  constraint  must  be 
stated  with  somewhat  more  generality  than  is  at  first  obvious, 
however 

Suppose  (to  move  to  an  artificial  example)  we  know  of  a 
certain  bail  that  it  is  a  ball  in  certain  room,  and  that  we  know 
that  fifty  percent  of  the  balls  in  that  room  are  black.  (A  natural 
way  of  designating  it  is  by  means  of  the  definite  description  "the 
ball  next  to  be  chosen."  We  could  also  take  the  individual 
concerned  to  be  a  choosing  of  a  ball;  the  latter  would  be  natural 
if  we  were  to  consider  repeated  samplings  from  the  room.) 
Suppose  we  know  also  that  that  particular  ball  is  also  one  in  an 
urn,  in  which  75%  are  black.  The  second  piece  of  statistical 
knowledge  is  clearly  epistemically  relevant  and  the  first  is  not 
This  intuition  is  based  on  the  fact  that  the  set  of  balls  m  the  urn 
in  the  room  is  a  subset  of  the  set  of  balls  in  the  room 

But  how  about  the  one-membered  subsets  of  the  power 
set  of  balls  in  the  room?  They  are  abstract  objects,  and  so  can't 
be  black,  but  they  can  have  the  property  of  having  a  black 
member  And  the  ball  in  question  is  not  a  member  of  this 
possible  reference  set,  but  of  course  its  unit  set  is.  And  its  unit 
set  will  have  a  black  member  if  and  only  if  the  ball  in  question 
is  black  So  what0  Well,  the  set  of  balls  in  the  urn  is  not 
(cannot  be)  a  subset  of  the  set  of  one  membered  subsets  of  the 
set  of  balls  in  the  room. 

We  could  stipulate  that  all  the  sentences  in  question  have 
some  specific  canonical  form;  but  we  shall  see  shortly  that  that 
is  not  such  a  good  idea  What  we  can  do  instead  is  to  formulate 
our  principle  a  bit  more  broadly 
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The  subset  principle  is  one  that  has  been  frequently 
identified  in  the  context  of  non-monotonic  logic. 

4.  Interference  II. 

Here  is  an  example  that  calls  for  a  second  principle:  As 
before,  suppose  we  have  a  roomful  of  urns,  and  that  a  designates 
a  ball  in  the  room  Suppose  we  know  that  there  are  100  balls  m 
the  room,  and  that  50  are  black.  But  suppose  we  also  know 
that  there  are  10  urns,  that  9  of  them  containing  four  black 
balls  and  one  white  ball,  and  that  the  tenth  contains  the 
remainder  of  the  balls.  The  relative  frequency  of  black  balls  in 
the  first  nine  urns  is  .8,  and  the  relative  frequency  of  black  balls 
in  the  tenth  urn  is  14/45  =  .311.  .  . 

Let  us  consider  what  statistics  are  relevant  to  the 
statement,  "  a  is  black."  If  we  know  of  a  only  that  it  is  a  ball  in 
the  room,  it  is  only  the  statistics  about  the  frequency  of  black 
balls  in  the  room  that  are  relevant.  If  we  know  also  something 
about  how  a  came  to  be  the  designated  ball,  the  other  statistics 
may  also  be  relevant.  For  example,  we  might  know  that  a  is 
the  ball  resulting  from  first  choosing  an  urn  at  random,  and 
then  choosing  a  ball  at  random  from  the  chosen  urn.  If  that  is 
the  case,  the  relevant  statistics  are  those  governing  the 
proportion  of  pairs  consisting  of  an  urn,  and  a  ball  drawn  from 
that  urn,  such  that  the  second  member  of  the  pair  is  black  We 
can  easily  calculate  the  proportion  of  pairs  having  this  property 
to  be  .9  *  8  +  .1  *  .31. ..  =  .751. . . 

But  why,  under  these  circumstances,  should  we  regard 
the  statistics  concerning  balls  in  the  room  to  be  epistermcally 
irrelevant0  The  interfering  set  isn't  a  subset  of  its  competitor.  • 
(Note  that  .751  .  cannot,  mathematically,  be  the  relative 
frequency  in  any  subset  of  the  st  of  balls  in  the  room1) 

But  we  can  find  a  relationship  there  is  a  possible 
reference  class  that  matches  the  competitor,  of  which  the  correct 
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reference  set  is  a  subset  —  namely,  the  cross  product.  ::  the  set 
of  urns  and  the  set  of  balls.  This  construction  is  parti -marly 
important  in  the  context  of  (so-called)  Bayesian  inference,  the 
model  we  just  looked  at  corresponds  to  a  non-sampling  case  in 
which  we  have  a  prior  probability  of  .  9  combined  with  a 
conditional  probability  of  .8,  and  a  prior  probability  of  1 
combined  with  a  conditional  probability  of  311  We  therefore 
call  the  rule  the  Bayesian  Principle: 

The  Bavesian  Principle:  Suppose  that  "  <a,  b>  is  a  B  "  is  in  our 
knowledge  base,  and  that  =  p  "  is  m  our  knowledge 

base.  Suppose  that  we  know  that  a  is  a  €'  if  and  only  if  a  is 
a  C,  that  a  is  a  B',  and  that  % (B',  C' )  =  p'  *  p  This 
statistical  knowledge  is  epistemically  irrelevant  if  we  know  of  a 
cross  product  of  B‘  with  B"  and  a  corresponding  subset  C"  and 
a  "  such  that 

(1)  <a  a  ">  is  known  to  be  in  B  '  X  B  ", 

(2)  <a  a  ">  is  in  C“  if  and  only  if  a  is  in  3", 

(3)  %(B'  X  B\  C")  =  p 

and  for  some  B  *  known  to  be  a  subset  of  B'  x  B ", 

(4)  %{B*,C)  =  p 

To  see  how  this  works  in  our  illustrative  example,  let  U be 
the  set  of  urns,  B  the  set  of  balls,  E the  set  of  pairs 
corresponding  to  the  experimental  set-up,  with  <a;  v>  m  f  just 
in  case  x  is  an  urn  and  yis  a  ball  in  that  urn.  Our  target 
property  is  the  set  of  pairs  £7 in  which  the  second  member  is 
black.  The  proportion  is  just  what  we  calculated  before  .751.  .  . 

To  show  that  a  ' ,  and  the  statistical  knowledge  that  the 
proportion  of  balls  in  B  that  are  black  is  .50,  is  not  epistemically 
relevant,  we  observe  that  <a',u>  is  known  to  be  a  member  of 
B  x  f(  where  u  is  the  (unknown)  urn  selected,  that  the 
proportion  of  B  X  U in  C is  0.50,  but  that  there  is  a  subset  of  B 
X  U —  namely  ^itself  —  in  which  the  proportion  of  members  of 
cT is  the  same  as  that  in  B 

The  Bayesian  principle  is  followed  in  constructing 
representations  of  uncertainty,  particularly  in  cases  in  which 
uncertainties  are  modified  by  new  evidence,  but  I  have  not 
noticed  it.  in  discussions  of  non-monotonic  inference.  It  should 
be,  of  course 

Almost  all  (species  of)  mammals  give  birth  to  thc.r  young 
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will  give  birth  to  its  young  live,  since  almost  all  species  of 
mammals  are  such  that  when  their  females  reproduce,  they  do 
it  that  way.  (Almost  all  the  reproductive  balls  in  almost  all  the 
urns  are  white,  though  it  is  not  the  case  that  almost  all  the  balls 
m  an  urn  are  reproductive . }  V/e  accommodate  the  eve-viviparous 
platypus  by  noting  that  its  species  (its  urn)  is  unusual 


5.  Interference  III. 

The  final  principle  of  relevance  we  need  for  dealing  with 
statistical  knowledge  is  in  a  sense  the  dual  of  our  first  principle, 
the  subset  principle.  Suppose  that  you  are  sampling  from  a 
population  /’with  a  view  to  making  an  inference  about  the 
proportion  of  3 's  there  are  in  P  it  is  a  general  set  theoretical 
fact  that  we  will  not  explore  more  deeply  that  almost  all  subsets 
of  a  given  set  reflect  within  narrow  limits  the  composition  of  the 
parent  set 

Putting  flesh  on  this  observation,  we  might  note  that 
(using  a  crude  approximation),  whatever  the  proportion  of  P's 
that  are  B' s,  the  proportion  of  10,000  member  subsets  of  /Mhat 
have  a  proportion  of  B 's  within  04  of  the  actual  proportion  is  at 
least  .975 

Suppose  you  look  at  10,000  P's  and  find  that  5000  of 
them  are  B's.  Quite  clearly,  at  a  level  of  confidence  of  975, 
one  ought  to  suppose  that  between  .46  and  .54  of  the  P's  are 
B's. 

Of  course  we  might  have  various  bits  of  knov/ledge  that 
are  relevant  to  this  fact  that  fall  under  the  first  two  categories 
For  example,  it  may  be  that  we  know  that  our  sample  is  not  a 
random  one,  because  we  know  that  it  was  drawn  in  a  special 
way  that  produces  representative  samples  only  rarely.  Or  we 
may  know  that  we  are  sampling  from  a  collection  of  populations 
in  which  we  know  something  about  the  distribution  of  the 
relative  frequency  of  B's 

But  let  us  assume  that  neither  of  these  are  the  case  -- 
that  is,  that  neither  the  subset  principle  nor  the  Bayesian 
principle  apply  So  we  may  say  that  the  chances  are  at  least 
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But  now  note:  We  also  have  observed  a  subset  5000,  oi 
which  100%  were  B  's  So  why  do  we  not  infer  (by  a  precisely 
parallel  argument)  that  the  chances  are  nearly  0.0  that  the 
proportion  of  S' s  is  between  .76  and  .57°  It  is  because  the 
larger  sample  is  epistemically  relevant  relative  to  the  smaller 
one,  while  the  smaller  sample  is  not  relevant  relative  t;  the 
larger  one.  A  principle  that  captures  this  intuition  is 


The  Supersample  Principle  Suppose  that  we  know  that  a  n  is  a 

member  of  F*1  and  that  we  are  interested  in  the  chance  that  an 

is  <2  (e  g.,  "representative  within  e") .  Suppose  that  ^  is  known 
to  be  a  member  of  R,  that  a  is  a  O'  if  and  only  if  a n  is  Q,  and 

that  %{Rn,  O)  =  p  #  p  '  =  %{R,  O')  are  all  known  Then  our 
statistical  knowledge  about  R  is  epistemically  irrelevant  if  there  is 
a  parallel  structure  to  our  original  one  that  is  such  that  we  also 
know  that  a  is  a  subset  of  an 


It  is  my  belief  that  these  three  principles  are  ail  the 
principles  we  need  to  determine  the  epistemic  relevance  of 
statistical  knowledge  in  the  case  in  which  we  either  have  exact 
knowledge  or  none  at  all. 


6.  Inexact  Knowledge. 

By  providing  a  new  characterization  of  "difference"  among 
statistical  statements,  we  can  easily  generalize  the  above 
considerations  to  the  general  situation.  Let  two  statistical 
statements  "%{A,  B)  e  [p,  g\"  differ  from  "%{C,  D)  e  [r,  sj"  just  in 
case  neither  of  [p,  g\  nor  [r,  ^  is  included  in  the  other  Then  we 
shall  say  that  one  item  of  statistical  knowledge  is  irrelevant  to 
another  if 

(a)  it  differs,  but  is  rendered  irrelevant  by  one  of  the 
three  principles  expounded  above,  or 

(b)  it  is  less  exact  than  the  other 

Note  that  a  consequence  of  thus  liberalizing  the  notion  of 
statistical  knowledge  is  that  we  now  aalways  have  statistical 
knowledge,  even  if  it  is  only  of  the  form,  " %(  A,B )  e  [0,l]"  In 
general,  (b)  leads  us  away  from  statistical  knowledge  of  this  form 
to  more  substantive  statistical  knowledge 
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algorithm  tor  computing  the  relevant  reference  class  under 
specific  epistemic  conditions  —  re.,  with  given  (plausible) 
background  knowledge.  Since  -as  is  obvious )  potential  reference 
classes  can  proliferate  indefinitely,  we  have  not  achieved  that 
point  yet;  and  providing  an  algorithm  is  net  part  of  the  present 
project  Nevertheless,  it  should  be  clear  where  we  can  go  from 
here  Further  details  are  provided  m  Kyburg  (1983)  and  Loui 
f 1986) . 


As  an  illustration  of  the  mechanism  we  can  employ,  we 
•can  consider  the  following  construction.  Let  an  inference 
structure  for  a  statement  S’  relative  to  a  body  of  knowledge  be 
a  quintuple  < a,  3,  Q  p,  q  >,  where  "a  is  a  C"  is  known  to  be 
equivalent  to  S,  the  statement  whose  epistemic  status  interests 
us,  "a  is  a  3  "  is  known,  and  "% (Z?  C)  c  [p,  q\"  represents  the 
strongest  information  we  have  about  3  and  C 

Consider  the  set  /  of  all  inference  structures  for  5  This 
set  is  non-empty,  since  <a,  {a  },  C,  0,  1>  is  a  member  of  it, 
whatever  else  we  may  know.  We  perform  pass  number  one:  if 
an  inference  structure  differs  from  an  earlier  inference  structure 
(i.e.,  neither  [p,q]  nor  [p',q'j  is  included  in  the  other)  then 
delete  the  irrelevant  inference  structure,  if  any;  otherwise  delete 
both.  The  result  is  a  set  of  inference  structures  that  do  not 
differ  from  one  another.  They  can  be  partially  ordered  by 
inclusion,  where  we  say  that  one  inference  structure  <a,  B,  C, 
p,  q>  is  included  in  another  <a‘,  B',  C',  p',  q’>  when  [p, q]  is  a 
subinterval  of  [p',  q'] . 

We  then  perform  a  second  pass,  reflecting  our  concern  for 
information,  by  deleting  any  inference  structure  that  properly 
includes  another  inference  structure  in  the  sequence.  The  result 
is  a  set  of  inference  structures  --  it  may  well  contain  more  than 
one,  and,  according  to  the  details  of  our  procedure,  may  contain 
an  infinite  number  --  that  are  all  equally  strong.  This 
determines  both:  the  epistemic  probability  of  the  statement  in 
question,  and  its  inductive  acceptability  —  i  e  ,  acceptability  for 
non-monotonic  logic  --  as  reflected  by  the  lower  bound  of  its 
epistemic  probability 
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8.  Conclusions:  We  arrive  at  several  conclusions 

(1)  If  we  accept  the  equivalence  condition  ---  that 
statements  connnected  in  our  knowledge  base  by  a  biconditional 
should  have  the  same  probability  then  many  more  statements 
than  might  at  first  have  been  thought  can  have  probabilities 
based  on  statistical  background  knowledge. 

(2)  This  has  a  profound  bearing  on  the  representation  of 
uncertainty  in  our  bodies  of  knowledge  If  we  suppose  that 
"subjective"  confidence  in  the  strong  sense  of  "subjective"  is 
acceptable  as  a  measure  of  uncertainty  only  when  statistical 
information  is  not  available,  then  there  are  far  fewer  situations 
in  which  purely  subjective  uncertainties  are  called  for  than  some 
people  have  suggested. 

(3)  Given  the  equivalence  condition,  there  may  be  many 
potential  reference  sets  for  a  given  equivalence  class  of 
statements.  We  therefore  need  a  way  of  adjudicating  our  choice 
among  these  reference  sets. 

(4)  There  are  three  ways  in  which  conflict  between  two 
potential  reference  classes  can  be  resolved  to  the  benefit  of  one  of 
them.  Only  one  of  these  ways  seems  to  have  worked  its  way 
into  the  literature  on  non-monotonic  logic.  All  three  should  be 
taken  account  of 

(5)  These  three  resolutions  reflect  the  three  principles: 
the  Subset  Principle,  the  Bavesian  Principle,  and  the  Superset 
Principle.  (In  fact  the  subset  principle  is  reducible  to  the 
Bayesian  principle  (see  Kyburg  1983)  .) 

(6)  The  results  of  this  analysis  can  be  used  to  implement 
probabilistic  non-monotonic  acceptance  as  well  as  to  determine 
rationally  allowable  distributions  of  uncertainty. 


Henry  E  Kyburg,  Jr  University  of  Rochester 
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